Refactoring by ArthurDeclercq · Pull Request #245 · CompOmics/ms2rescore

ArthurDeclercq · 2026-01-09T15:39:10Z

No description provided.

…into spectrum-feature-generator

…trum-feature-generator

pull main in spectrum-feature-generator

…rator

…pectrum-feature-generator

…mics/ms2rescore into spectrum-feature-generator

…to continue from

paretje · 2026-02-27T15:11:01Z

ms2rescore/feature_generators/deeplc.py

+
+            # Fit calibration and transform all predictions for this run
+            calibration = SplineTransformerCalibration()
+            calibration.fit(


_get_calibration_data doesn't throw an exception if there are no target psms, contrary to _get_calibration_psms. However, unless I'm missing something, fit will still fail if observed_rt_calibration or predicted_rt_calibration are empty. So it should probably be handled here if one of the runs doesn't have any target psms.

paretje · 2026-02-27T15:57:32Z

ms2rescore/feature_generators/deeplc.py

+        # Calibrate predictions per run
+        logger.info("Calibrating predicted retention times per run...")
+        for run in psm_list_df["run"].unique():
+            run_df = psm_list_df[psm_list_df["run"] == run].copy()


run_df is copied twice, here, and in _get_calibration_data. On top of that, if it would be okay to sort psm_list_df, we could also avoid copying run_df at all, and we could just use:

calibration_df = run_df[~run_df["is_decoy"]].head(num_calibration_psms)

I suspect sorting all psms, instead of only the targets might be faster than copying all targets if len(target_df) is considerably greater than num_calibration_psms. But of course, if psm_list_df shouldn't be sorted in place, then you can probably ignore this, except for the double copy.

ArthurDeclercq and others added 30 commits February 24, 2024 15:48

initial commit

fdceeba

finalize ms2 feature generation

5374ed8

add rustyms

60207a3

remove exit statement fixed IM required value

ae39844

change logger.info to debug

9b98c4d

added profile decorator to get timings for functions

5e45756

removed profile as standard rescore debug statement

304777c

added new basic features

95ee475

fixes for ms2 feature generator, removed multiprocessing

73f4573

return empty list on parsing error with rustyms, removed multiprocessing

947233e

add deeplc_calibration psm set

24ce565

Merge branch 'timsRescore' of https://github.com/compomics/ms2rescore …

114b006

…into spectrum-feature-generator

remove unused import

33c38b0

Merge branch 'timsRescore' of https://github.com/compomics/ms2rescore …

40425c7

…into spectrum-feature-generator

Merge branch 'timsRescore' of https://github.com/compomics/ms2rescore …

b810b8c

…into spectrum-feature-generator

Merge tag 'main' of https://github.com/compomics/ms2rescore into spec…

69b5d1a

…trum-feature-generator

Merge pull request #177 from compomics/main

6e2d102

pull main in spectrum-feature-generator

integrate mumble into ms2branch

11fdc51

Merge remote-tracking branch 'origin/main' into spectrum-feature-gene…

3140c44

…rator

temp removal of sage features before rescoring

883169a

Merge branch 'main' of https://github.com/compomics/ms2rescore into s…

97865e7

…pectrum-feature-generator

remove psm_file features when rescoring with mumble

da39ae8

linting

37fff28

add hyperscore calculation

e8b59f3

calibration fixes

c51cd34

changes for mumble implementation

295e37f

change openms peptide formatting

909860d

add mumble psm filtering functionality

c5902c2

Merge branch 'spectrum-feature-generator' of https://github.com/compo…

6eaceb2

…mics/ms2rescore into spectrum-feature-generator

remove pyopenms dependency for hyperscore calculation

5ce55f5

ArthurDeclercq added 28 commits December 24, 2025 16:45

parsing spectra once and storing spectra objects

3091b0f

directly operate on spectra objects instead of reacquiring them

5b3d4c4

updated profiling

a3cbb1b

removed maxquant generator from fg

a1df72d

changes to column names

2c7a09b

changes to avoid out of memory error due to multiprocessing

e855abf

replace list with set to reduce lookup time to O(1)

577df19

remove unused imports

a80238b

migrate ms2 and ms2pip features to ms2rescore-rs

abf66b4

reimplement deeplc feature calculation

a9108b9

change logging

698dd5e

fix im2deep fg

862f9be

add support for continue runs and writing intermediate file on error …

889b42d

…to continue from

changes to default features sets instead of based on charge

ba930b8

minor changes

a3875da

conditional import of mumble

8e793a0

add tracking to spectrum file reading

07b96e7

change rust function names

496c0b8

minor changes to logging and other bugfixes

95e149e

add deeplc plot to plotting module

6f49935

making report generation funcitonal again

d66426e

change fg colors

c132684

remove ionmob from ms2rescore

e782616

update required python version

f011705

remove ionmob from gui

623b95f

update numpy versioning

704c22d

updata colors of report

4cce1f6

updated documentation on intermediate files

13a72b8

paretje reviewed Feb 27, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Refactoring#245

Refactoring#245
ArthurDeclercq wants to merge 77 commits intomainfrom
refactoring

ArthurDeclercq commented Jan 9, 2026 •

edited

Loading

Uh oh!

paretje Feb 27, 2026

Uh oh!

paretje Feb 27, 2026

Uh oh!

Reviewers

Assignees

Labels

Milestone

Development

Uh oh!

3 participants

Conversation

ArthurDeclercq commented Jan 9, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

paretje Feb 27, 2026

Choose a reason for hiding this comment

Uh oh!

paretje Feb 27, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Milestone

Development

Uh oh!

3 participants

ArthurDeclercq commented Jan 9, 2026 •

edited

Loading